Bayesian inference of the resonance content of p(7, K^)K 
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A Bayesian analysis of the world's p(7, K^)K data is presented. From the proposed selection of 11 
resonances, wo find that the following nuclcon resonances have the highest probability of contributing 
to the reaction: 5ii(1535), 511(1650), Fi5(1680), Pi3(1720), Di3(1900), Pi3(1900)', Pii(1900), and 
i^i6(2000). We adopt a Rcggc- plus-resonance framework featuring consistent couplings for nucleon 
resonances up to spin J = 5/2. We evaluate all possible combinations of 11 candidate resonances. 
The best model is selected from the 2048 model variants by calculating the Bayesian evidence values 
against the world's p(7, /f"'")A data. 

PACS numbers: 14.20.Gk, 14.40.Df, 11.55.Jy 



A thorough knowledge of the nucleon- resonance {N*) 
content of open-strangoness production reactions could 
draniatic;ally improve our understanding of the nucleon's 
structure. Indeed, it provides a test-bed for the predicted 
N* spectra from competing baryon models [1, 2]. De- 
spite being the subject of numerous analyses, the set of 
iV*s that contribute to p{'^,K^)h. is not unambiguously 
determined. The Particle Data Group (PDG) [3] lists 
four resonances with a fair evidence of existence in the 
iiT+A decay channel. Of these, only the 5*11(1650) has a 
three-star status which corresponds to a very likely con- 
tribiition to this channel [3] . This is reflected by the often 
contradictory outcomes of different analyses on which the 
PDG ratings are based. This disparity is illustrated in 
Table I. The persistent lack of consensus, despite the in- 
creasing availability of ^3(7, if +)A data, can be attributed 
in part to the important role played by non-resonant dy- 
namics. 



Table I. The sets of Ai'*s included in various psoudoscalar me- 
son photoproduction analyses, compared to the results of this 
work. The nomenclature L2i,2j{Mm*) is used, where L is 
the orbital angular momentum of the ttN partial wave, / is 
the isospin, J is the spin and Mn* is the mass of the reso- 
nance. The overall PDG ratings are given for each A'^* . Miss- 
ing states predicted by constituent quark models are denoted 
with m. Along with the Pii(1440) and Di3(1520), the N*s 
with J > 7/2 are not considered in this work. 
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The criterion to determine whether a resonance con- 
tributes significantly or insignificantly varies among dif- 
ferent analyses. In this Letter, we wish to address this 
issue in a statistically solid way, using Bayesian inference. 
The past decade has seen the development of advanced 
coupled-channels (CC) models [4, 5, 7, 12, 13]. The effect 
of channel openings has been identified as playing an im- 
portant role in the reaction dynamics [14]. As Bayesian 
inference requires non-trivial numerical computations in 
the parameter space, to date it can only be done in an 
efficient single-channel reaction model, which does not 
capture the full complexity of CC models. We perform 
an analysis using a set of nucleon resonances that are 
likely to contribute to ^(7, K'^)A, within the Regge-plus- 
resonance (RPR) model [9, 15, 16]. The RPR model is 
devised as a unified description of both the high-energy 
region (\/s > 2.5 GeV), where the differential cross sec- 
tion is forward peaked, and the resonance region (y^ < 
2.5 GeV). In the RPR approach, the high-energy region 
is described using a Regge model. It is based on the ex- 
change of the iir+(494) and K*+{892) trajectories in the 
t-channel and is parametrized by three coupling strengths 
and two phases [17]. In the resonance region, the Regge 
model provides a fair description of the elusive back- 
ground. By coherently adding the s-channel nucleon- 
resonance contributions in this energy region, one ob- 
tains a description of the electromagnetic K~^A produc- 
tion process for photon energies from threshold up to 
16 GeV [9]. 

Our formalism makes use of the recently suggested 
consistent couplings for resonances with J = 3/2 and 
J = 5/2 [18]. This means that all spurious degrees of 
freedom due to the lower-spin components are removed 
from the J > 3/2 propagators. In addition, the cou- 
plings of all resonances with J > 3/2 can be described by 
a mere two parameters. A spin-dependent multidipole- 
Gauss hadronic form factor (FF) [18] is employed to reg- 
ularize the resonance contributions beyond the N* pole 
{y/s > M]sr*). In order to minimize the number of param- 
eters, we adopt one common cut-off value for the hadronic 
FF for all N*s. 
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The reggeizcd background is constrained using photo- 
production data above the resonance region. In pion pho- 
toproduction [19] the resonance region extends to ^/s ~ 
2.5 GeV. In previous work [20-23], the 72 ^(7, K+)k data 
points with y/s > 3 GeV were employed to determine 
the parameters of the reggeized background. The CLAS 
collaboration has recently published p{j, K^)A data for 
y/s > 2.5 GeV [24]. The CLAS data are inconsistent 
with those collected in the sixties and seventies [25]. Sim- 
ilar discrepancies were foimd in other pscudoscalar me- 
son photoproduction channels [25]. For the analysis pre- 
sented here, we have used a subset of the recent CLAS 
data for which 2.6 GeV < y/s < 2.84 GeV and kaon 
center-of-momentum angle cos O^^ > 0.35 to constrain the 
reggeized background model. By means of a Bayesian 
analysis, analogous to the procedure described in Ref. 
[20], we have determined the optimal background model 
variant which is dubbed Regge-2011. This background 
model features rotating phases for both trajectories, and 
positive tensor and vector coupling strengths for the K*'^ 
trajectory. The RPR amplitude is constructed from this 
background model by adding a set of s-channel contribu- 
tions. 

The challenge at hand is to determine which set of 

resonances gives rise to the most probable RPR model 
variant M, given the world's p{'-f,K~^)A data of the last 
decade, {dk}- The data include 3455 differential cross 
sections, 2241 single, and 452 double polarization ob- 
scrvablcs [9, 24, 26, 27]. The set of resonances can 
be determined by evaluating the conditional probabil- 
ity P(M| {4}) - ^({4} |M) P(M)/P({4}) [20, 28] for 
each model variant M. The factor P({4} \M) is known 
as the Bayesian evidence Z. The probability ratio of two 
different models Ma and Mb, given the data set {dk} 
can be expressed as 

PjMAljdk}) ^ P({4} \Ma) P{Ma) ^ Za P{Ma) 
P(Mb|{4}) P{{dk}\MB) P{Mb) Zb P{Mb)' 

(1) 

As we have no prior preference for any of the models, the 
factor P{Ma) / P{Mb) is set to one, and the probability 
ratio of Eq. (1) can be expressed in terms of the evidence 
ratio (or, Bayes factor) Za/Zb- The evidence is calcu- 
lated by marginalizing over the model's parameters olm 
[20, 29], 

Z = j P{{dk) ,cxm\M) dotM = j jC{aM)n{aM)dcxM- 

(2) 

The prior distribution ttIcxm) = P{otM\ M) is chosen to 

be a imiform distribution [28] between —100 and +100 for 
the coupling strengths. This choice is motivated by natu- 
ralness arguments: coupling strengths of 100 give rise to 
a total cross section exceeding 25 /ib, thereby overshoot- 
ing the measured ^(7, K^)A by several factors. 



The likelihood function C{aM) = P{{dk} \ otM, M) is 
parametrized by a chi-square distribution. In evaluating 
^{ctM), the customary estimate of the total squared er- 
ror of a data point is the sum of the squared systematic 
and statistical errors: a^^^ = a^^^ + c^'^^|^^. Due to the 
non-Gaussian and correlated nature of the systematic er- 
rors, the use of a chi-square distribution underestimates 
the real errors and the resulting evidences Z [30]. The 
total error is underestimated by \/2 if the two errors are 
equal. Furthermore, there are generally at least two dif- 
ferent sources of systematic errors, which are also added 
quadratically. A more conservative calculation, where 
the errors are added linearly, yields as a total error 

1 -t- 

'^tot = ^stat + CTgys ~ ^stat + V2asys ~ — ^ O-tot- (3) 

The replacement Ctot — ^ <^iot boils down to rcscaling the 
errors in the chi-square distribution with c = ^^^/^ ■ The 
bulk of Z is determined by max{£(aA^)}, so one can 
correct for this underestimate by considering the scaling 
behaviour of the chi-square distribution at Xmin if the 
error is multiplied by c. The following relation holds, 

= (fc - 2) In c - x?,(aM) I , (4) 

where £^(aM) is the chi-square distribution for which 
the errors have been multiplied by c. k is the number 
of degrees of freedom of the chi-square distribution, i.e. 
the number of data points minus the number of free pa- 
rameters. This results in the following correction for the 
computed evidence .E of a model, 

InZ' ^\nZ-{k-2)lnc + x%minl^^, (5) 

where xfi mm i^ the model's minimum reduced value. 

Jeffreys' scale [31] associates the logarithm of the ev- 
idence ratio of Eq. (1) with a qualitative statement on 
the relative probabilities for two models. It states that 
a value of AlnZ = \n.{ZA/ZB) > 1 corresponds with 
significant evidence in favor of the more probable model, 
whereas a value smaller than 1 is barely worth mention- 
ing. A value larger than 2.5 is strong to very strong, and 
a value of 5 and larger is decisive. This scale is employed 
to decide which resonance set describes the p{'y, K~^)A 
data best with the RPR model. 

The numerical evaluation of Eq. (2) is very cumber- 
some, as the bulk of the likelihood ^(cKjvf) is usually 
concentrated in a very small region of the multidimen- 
sional parameter space. Therefore, we adopt a numerical 
procedure which includes different steps. First, we em- 
ploy a genetic algorithm to locate the global optimum in 
the likelihood hypersurface. Next, the covariance matrix 
about the optimum is determined using the MINOS rou- 
tine in root's minuit package [32]. Finally, the VEGAS 
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algorithm [33] is adopted to calculate the evidence inte- 
gral within the error boundaries determined by minos. 
This localized integration yields a first estimate for the 
evidences Z. As we are dealing with 6148 data points, 
the likelihood is apt to be unimodal and peaked in a small 
region in parameter space. Moreover, the chi-square dis- 
tribution £,(olm) falls off very steeply with increasing . 
Therefore, the volume about the global maximum in the 
likelihood surface effectively represents the bulk of the 
integral. 

We have evaluated the evidence integral Z' for all 
model variants corresponding to combinations of the 
nucleon resonances listed in Table I. We consider es- 
tablished resonances, for which substantial experimen- 
tal evidence exists: 5*11(1535), 511(1650), -Di5(1675), 
i^i5(1680), 1513(1700), Pii(1710), and Pi3(1720). The 
less-established Pi3(1900) [5, 7] and Fi5(2000) resonances 
are also included. The Pii(1440) and Di3(1520), which 
have masses significantly below threshold, are not con- 
sidered in our single-channel formalism. We include the 
"missing" L»i3(1900) and Pii(1900) resonances. Both 
have been identified by at least one analysis as contribut- 
ing to the X+A channel [5-8]. In a single-channel reac- 
tion model it is customary to introduce TV* propagators 
with a single pole in the complex plane. Thereby, the 
dynamical origin of the iV* [14] is approximated by an 
effective mass and width. We have adopted the PDG val- 
ues for the Breit-Wigner masses and widths if available. 

We consider all possible combinations of the 11 pro- 
posed resonances, which yields 2048 model variants. In 
Fig. 1, the computed evidence values —\nZ' are dis- 
played in the evidence map of the RPR model space. 
The number of free parameters per resonance is 1 for 
spin-1/2 and 2 for higher-spin resonances. Points in the 
same column represent model variants that have the same 
number of free parameters, but have a different selection 
of resonances. This figure illustrates that increasing the 
number of N* parameters does not necessarily result in 
an improved evidence. The In Z' does improve by almost 
two orders of magnitude by including A'^*s. 

The model with the highest evidence value has 14 
N* parameters and features the resonances 5ii(1535), 
5ii(1650), Fi5(1680), Pi3(1720), ^3(1900), Pi3(1900), 
Pii(1900), and Pi5(2000). This model is dubbed RPR- 
2011. A comparison with the second-best model, which 
has 12 N* parameters and does not feature the missing 
Di3(1900) resonance, yields a difference of AlnZ = 2.3, 
corresponding with significant to strong evidence in favor 
of the RPR-2011 model. 

A selection of the p{j, K^)A differential cross sections 
and recoil polarizations P as calculated by the RPR-2011 
model and the Regge-2011 background model are pre- 
sented in Fig. 2. The figure illustrates that while the 
differential cross section is dominated by the background 
amplitude, polarization observables can be highly sensi- 
tive to N* contributions. The highest sensitivity to the 
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Figure 1. (color online). The evidence values of the 2048 
model variants in the RPR model space (blue circles), as a 
function of the number of free A*" parameters. The smaller 
the value of — In Z' the higher the evidence. The best model 
for a fixed number of parameters is indicated with a red 
square, the overall best model (RPR-2011) with an open cir- 
cle. Top right inset: evidence ratios relative to RPR-2011 
— In (^j'/2^j^PR-20ii) for the models with the lowest — InZ'. 
The color coding refers to Jeffreys' scale: barely worth men- 
tioning (orange), significant (yellow), strong to very strong 
(green) and decisive (white). 



N* contributions can be observed at backward kaon an- 
gles 9*j,. 

We use the 2048 evidences of Fig. 1 to determine 
the conditional probability of the individual resonances, 
given the 6148 measured p{'j, K^)A observables, 



P(P|{4}) 



Mi\ReMi 



P (M,) 
P{{dk}y 



(6) 



where the summation includes the n model variants Mi 
containing R. The second factor on the right hand side, 
P (Mi) /P ({c?fc}), is equal for all models Mi, so it drops 
out of the probability ratio P (Pi | {dk}) /P (P2 | {dk}) 
of two resonances. 

The resuhs of Eq. (6) with P (M,) /P {{dk}) set equal 
to 1/n are shown in Fig. 3. This reveals that the reso- 
nances which have the highest probability of contribut- 
ing to p(7, K^)K are those that constitute the resonance 
set of RPR-2011. This set features the three resonances 
with a mass around 1900 MeV that are predicted by con- 
stituent quark models [1] , but not by quark-diquark mod- 
els [2]. Moreover, we find no significant contribution of 
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Figure 2. (color online). The angular dependence of the p{'j, K'^)A differential cross section and recoil polarization P at various 
ojiab- The full red line represents the RPR-2011 model and the blue dashed line corresponds to the reggeized background model 
Regge-2011. Data are from Refs. [24, 34-38]. 



the Pii(1710) resonance to the p{j, K^)A reaction. In 
the latest SAID analyses [11, 39], this resonance was not 
needed for the description of ttN scattering either. The 
Pii(1710)'s negligible coupling to TriV and its absence in 
reactions where the rnrN channels are not relevant can 
be attributed to it being a resonance in the ttttN system 
[40]. 

Summarizing, we have addressed the issue of investi- 
gating the resonance content of p{j, K~^)A. This chan- 
nel is known to receive large non-resonant contributions 
which complicates the extraction of N* information. The 
non-resonant dynamics can be effectively handled in a 
Regge formalism with a mere three parameters and two 
phases. The efficiency of the RPR model has enabled us 
to perform a Bayesian analysis. From a proposed set of 
11 iV*s, we have identified the 8 iV*s with a high condi- 
tional probability of contributing to p{'-f, K'^)A. 

Bayesian inference has the power to reduce the bias 
in identifying the N* content in more advanced reaction 
models, but its dimensional curse leads to computational 
hurdles which cannot be overcome to date. 

This research was financed by the Flemish Research 
Foundation (FWO Vlaanderen). The calculations were 
carried out on the Stevin Supercomputer Infrastructure 
at Ghent University, funded by Ghent University, the 
Hercules Foundation and the Flemish Government - de- 
partment EWI. 
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Figure 3. The relative resonance probabilities 

— In P (_R I {dfe}) of Eq. (6), computed from the evidences 
Z' of Eq. (5), with P (Mi) / P {{dk}) = 1/n. The three 
resonances that stand out are the ones that are not included 
in RPR-2011. 
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